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Abstract 

We study the optimal use of three-point statistics in the analysis of weak lens- 
ing by large-scale structure. The three-point statistics statistics have long been 
advocated as a powerful tool to break measured degeneracies between cosmolog- 
ical parameters. Using ray-tracing simulations, incorporating important survey 
features such as a realistic depth-dependent redshift distribution, we find that 
a joint two- and three-point correlation function analysis is a much stronger 
probe of cosmology than the skewness statistic. We compare different observing 
strategies, showing that for a limited survey time there is an optimal depth for 
the measurement of three-point statistics, which balances statistical noise and 
cosmic variance against signal amplitude. We find that the chosen CFHTLS 
observing strategy was optimal and forecast that a joint two- and three-point 
analysis of the completed CFHTLS- Wide will constrain the amplitude of the 
matter power spectrum erg to 10% and the matter density parameter Qm to 
17%, a factor of 2.5 improvement on the two-point analysis alone. Our er- 
ror analysis includes all non-Gaussian terms, finding that the coupling between 
cosmic variance and shot noise is a non-negligible contribution which should be 
included in any future analytical error calculations. 
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1. Introduction 



Weak gravitational lensing by large scale structure is a unique tool to probe 
the matter distribution of the Universe regardless of its dynamical state. When 
combined with redshift information weak lensing can be used as a probe for dark 
energy evolution as the expansion of the Universe affects the mass clustering at 
different redshifts. Dark energy constraints from weak lensing rely on accurate 
measurements of dark matter power spectrum amplitude. The two-point cosmic 
shear statistics offer a powerful technique to measure the matter normalization 
parameter era and the mass density parameter f2m combined (see for example 
the recent work on the CFHTLS by [H and Q])- One of the important goals for 
better determination of the cosmological parameters is to improve the individual 
measurement of as and flm- Better estimation of erg and flm allows for the 
alleviation of the residual parameter degeneracies jS]. A noticeable example is 
the neutrino mass 

Three-point shear statistics, in particular the skewness, are in principle pow- 
erful estimators to break the degeneracy between ag and l^m ([1] and p). Some 
detections have been reported from the VIRMOS survey (Q and Q) and from 
the CTIO survey Unfortunately, the signal-to- noise ratio remains low and 
there are still no reliable forecasts of three-point statistics which take into the 
account realistic galaxy number counts and shape noise as well as non-Gaussian 
contributions in the cosmic variance. Therefore, the interpretation of the mea- 
surement is currently not well-established. In [lO| the authors concluded that 
the three-point statistics of lensing signal is greatly enhanced at small angular 
scales because of the non-linear gravitational clustering, but they did not pro- 
vide an estimate of the signal-to-noise ratio for different survey depths. In 
it was shown that the skewness of the convergence can be measured from mass 
maps reconstructed from the shear measured on individual galaxies, however, 
a realistic population of source galaxies was not considered, and the simula- 
tions were limited to second order perturbation theory. jTl| showed that one 
can learn additional information by combining the two- and three-point statis- 
tics, but again neither a realistic source galaxy distribution nor different survey 
strategies were considered. 12] also showed that combining the power spectrum 
and bispectrum tomography information enhances the accuracy of cosmological 
parameter estimations. 

In this paper we investigate the optimal use of three-point statistics in a 
weak lensing analysis of large scale structure, considering several new aspects 
that have been neglected in previous works: 

• A realistic noise contribution using ray-tracing simulations calibrated on 
existing surveys is included. 

• Realistic forecasting for the two- and three-point statistics for different 
survey strategies is provided. 

• For a fixed observing time, wide-shallow and narrow-deep strategies are 
considered. The impact of the survey's depth on both the galaxy num- 
ber density and the source redshift distribution is quantified. Surveys 
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with different characteristics are affected differently by cosmic variance, 
with wider surveys probing a much larger number of modes than nar- 
row surveys. Here we carefully investigate this aspect by comparing the 
performance of various simulated surveys which use a realistic source dis- 
tribution. 

• The source distribution has been derived using galaxy counting as a func- 
tion of redshift as measured in real data for a fixed limiting magnitude. 

• The full likelihood analysis with covariance matrices are computed from a 
large set of ray-tracing simulations. It is therefore an extension of previous 
works which used fisher matrices to gauge the performance of weak lensing 
surveys (e.g. (ist). 



Following 1J| a comparison of different smoothing filters is included. 



A range of most optimal smoothing scales are found by investigating the 
various contributions of noise and signal to the full covariance matrix. 

The best survey strategy for detecting the skewness of the convergence S3 
as means of breaking the degeneracy between ilm and erg is studied. The 
idea first emerged in and Q but its feasibility never quantified. 

The efficiency of combining the two- and three-point statistics is quanti- 
fied. 



• Two- and three-point statistics forecasts for the completed CFHTLS sur- 
vey and the KIDS survey are calculated. 

The paper is organised as follows. In fj2l we summarize the background 
theory of the two- and three-point statistics of the convergence field, where 
notations and definitions are also introduced. The details of the method are 
described in Optimal survey strategies are shown in 3H and ^ shows the 
predictions of two- and three-point measurements of the simulated complete 
CFHTLS- Wide survey area and depth. The upcoming Kilo Degree Survey 
geometry is also discussed here as an example of the accuracy achievable on the 
measurement of the two- and three-point statistics in the near future. Finally 
in blithe conclusions of this study are stated. 

2. Theory Background 

Following [15] and [Ti| we can write the convergence k at a given sky position 
6 as : 



n{9) = ^^^m u;{z)6U,0)dx (1) 
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where x is the angular comoving distance, is the mass density parameter at 
the present day, 5 is the matter density contrast and w(z) for a given redshift z 
is given by: 



uj{z) 



{l + z)x{z) / n{zs) 



1 



X{zs) 



dzs 



(2) 



w(z) depends on the cosmofogical parameters and the galaxy source distri- 
bution function n{zs). The convergence maps are obtained from ray-tracing 
simulations, as described in Section [3. II 

Note that this analysis employs the convergence field k, which is proportional 
to the projected mass density. The convergence can be obtained from the shear 
data 7 = (71,72) either by appropriate weighting with an aperture filter, or 
from mass reconstruction with e.g. a top-hat or Gaussian filter. Therefore, 
the conclusions of this study apply to the convergence and the shear without 
distinction. We are interested in the measurement of (k^), (k^) and the skewness 
5*3 (k) defined as: 



(3) 



Skewness is essentially a measure of the clustering of the mass distribution as 
defined in 3| . According to perturbation theory S'3 provides a measurement of 
r^m independent of the normalization of power spectrum erg. For this reason, the 
skewness of the convergence appears as a very attractive probe of cosmology and 
a useful technique to break degeneracies among other cosmological parameters. 



3. Analysis Method 

3.1. Ray-tracing Simulations 

This analysis is based on a set of simulated K-maps, with 60 lines of sight each 
containing 40 redshift slices from z = 0.020 to z = 3.131. These are generated 
from 22 independent N-body simulations by randomization. As a result, the 
different lines of sight are not totally independent on large scales. However 
they can still be considered approximately independent on scales smaller than 
1 degree. 

The Multiple Lens-Plane ray-tracing approximation method was used to gen- 
erate the lensing convergence map: the dark matter distribution in the universe 
is approximated by a series of mass sheets. The N-body simulations are on a 
grid of 1728"^ points with 856'^ particles, and the box size is 120 /i^^Mpc. The 
mass density in the simulation box is projected to the mid-plane at a series of 
characteristic redshifts. The output redshifts are picked so that the consecutive 
time slices can represent the continuous evolution of the large scale structure. 
The three orthogonal axes of the box are x, y, and z. For every output red- 
shift, we make three projection sheets, parallel to the xy, yz and xz planes. 
We choose one projection sheet out of the three of one N-body simulation in a 
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Resolution: 0.21'/pixel 



Figure 1: A schematic of the simulated convergence maps at different redshift sHces.The maps 
arc on 1024^ with 0.21 arcmin per pixel. The redshift ranges from z=0.020 to z=131. 

random order, as well as randomly shifting the sheet transverse to the projec- 
tion direction. This technique is employed to avoid creating periodicity in the 
projection. Rays are shot through these mass sheets. We calculate k on every 
sheet and project them along lines of sight after the random shift and rotation. 

The maps are on 1024x1024 grid with spacing of 0.21 arcmin. Thus the 
total area is about 12.84 deg^ for each line of sight. We assume the following 
cosmology: Acdm with ilm = 0.24, Ha = 0.76 and as = 0.74 [l3|. Figure [T] 
shows a schematic of the different redshift slices which were combined for each 
line of sight. 

The N-body simulations are generated by the CUBEPM code, which is the 
successor of PMFAST pjj. CUBEPM is MPI parallehzed particle-mesh (PM) 
code, and has particle-particle force implement at sub-grid scales. It is further 
parallelized by shared-memory OpenMP on each node. The simulation volume 
(which is also called simulation box) is cubically decomposed to sub- volumes, 
and the calculation of each sub- volume is performed on one node of the cluster. 
The total number of nodes used in simulation is with n = 3 here. The code 
can be run on up to 1000 nodes. The simulations are run on the Sunnyvale 
cluster of CITA. 

For each of the redshift slices the average (k^), (k^) and S3 are measured and 
the signal is compared with a theoretical model. The two-point cosmological 
predictions are based on the Peacock and Dodds [l3| non- linear fit, whereas the 
thr ee-p oint shear statistics predictions use the bisgectrum non-linear fit derived 
in [20| and implemented for lensing studies in [10|]. The excellent agreement 
between the measured and the predicted signal can be seen in figure [2] where 
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Figure 2: The agreement between the measurements and the theoretical predictions for three 
individual redshift slices. The low redshift slice is at 0.186, the medium slice at 0.668 and 
the high redshift slice is at 2=2.690. The blue lines show the measurements on the simulated 
12.84 deg'^ data, and the pink lines show the theoretical prediction for the same cosmological 
model at the same redshift. The measurements in each panel are made from data smoothed 
with a top-hat filter. The errorbars represent the cosmic variance over 60 lines of sight. 



the results for low, intermediate and high redshift slices are shown. 



3.2. Galaxy Number Density and Redshift Distribution 

In this paper we compare different survey strategies with varying source 
redshift distribution that is dependent on the survey depth. We calibrate the 
redshift distribution from existing optical surveys with photometric redshift 
information and populate the ray-tracing slices accordingly. The focus here is 
on ground based surveys, but the result can be straightforwardly extended to 
space data with an appropriate scaling of the shot noise (which directly depends 
on the galaxy shape noise and number density). 

The galaxy number density and redshift distribution as a function of limiting 
magnitude are estimated from the CFHTLS-Deep survey catalogue in i-band 
2l|. To model the galaxy redshift distribution n{z) for surveys of different 
magnitude limit mjim, the method described in and [2^ was employed, 
modeling rt(z,miim) as: 



n(z,miim) = 



/3 



.or {^) 



^;o(mii„i) 



exp 



(4) 
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Figure 3; The fit to the normalized galaxy number density from the CFHTLS-Deep survey 
catalogue [2l| . The black line shows the histogram of the galaxy number density and the dark 
blue line is the fitted curve. The limiting magnitude m]i,jj =24.5 with the fitting formula given 
by equation (|4]|. Here a, /3 and zq are 0.96, 1.70 and 1.07 respectively. 



The best parametric fit to equation ^ for limiting magnitude i =24.5 cor- 
responds to a=0.96, /3=1.70 and zo=1.07. Figure [3] shows the histogram of 
the normahzed galaxy redshift distribution from the CFHTLS-Deep survey cat- 



alogue [2l| at miim =24.5 and the best fit n{z) from equation Table [T] 



summarizes the values of a, (3, zg and the median redshift Zmcd for the other 
magnitude cuts used in this paper. Equation Q yields a realistic source redshift 
distribution for a given survey's depth [23|. Thus, the theoretical predictions 
built based on the appropriate form of equation (j4]) match the K-maps weighted 
by the galaxy number density derived from the CFHTLS-deep catalogues. 



miim 


a 


/3 


zq 




22.5 


0.76 


6.85 


1.05 


0.68 


23.0 


0.71 


5.30 


1.14 


0.72 


23.5 


0.81 


3.15 


1.19 


0.80 


24.0 


0.80 


2.72 


1.26 


0.84 


24.5 


0.96 


1.70 


1.07 


0.91 


25.0 


0.85 


1.90 


1.26 


0.96 


25.5 


1.46 


1.30 


0.75 


1.02 


26.0 


1.71 


1.27 


0.68 


1.04 



Table 1: The best fit values of a, /3 and zq corresponding to equation l|4]l for several i-band 
limiting magnitudes. These parameters were used to generate theoretical models for Section 
|4]to determine the best survey strategy. The last column contains the median redshift Zmcd 
for each magnitude cut. 
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3.3. Statistical Noise 

The source of shot noise in weak lensing studies depends on the intrinsic 
elhpticity characterized by the r.m.s. <t^ and by the number density of galaxies 
Ug. It was shown in 24] that the noise in a smoothed convergence map can 
simply be derived from the intrinsic elhpticity noise and the galaxy number 
density. In particular, it was shown that the noise in a pixelated smoothed 
K map is simply given by a smoothed uncorrelated Gaussian noise with r.m.s. 
CTe. If Ug denotes the number density of galaxies and W{0) the 2-dimensional 
smoothing function, then the correlation function of the convergence noise is: 



m^n{e')) - 



1 



2 Q'^Ua 



dke 



ik.(6/-0') 



#(k) 



(5) 



where K,n{0) is the convergence noise map and VF(k) is the Fourier transform of 
the smoothing window W{6). 8 is the pixel size, so Q^Ug is the average number 
of galaxies per pixel. 

The galaxy elhpticity r.m.s. measured on CFHTLS-deep data is = {al^ + 
(7^2 ) = 0.44. For the purpose of this paper we will assume that tXe is constant 
as a function of redshift and galaxy type. The convergence noise variance per 
pixel (before smoothing with W) is therefore given by: 



1 



2 



(6) 



Note that the noise model considered here implicitly assumes sources galaxies 
are distributed randomly in each redshift slice. By construction, this choice 
ignores any potential effect caused by source clustering, which is known to be a 
source of contamination for three-point statistics ([2^ and [2^). 

3.4^. Smoothing Filters 

Convergence statistics can be measured from smoothed K-maps (which can 
be obtained from smoothed shear maps from the data) . Various statistics can be 
build by using different smoothing filters. Following the widely accepted choice 
the top-hat, and two types of the compensated (the total area under the filter 
window is equal to zero) filters were considered. The two compensated filters 
used were the ones introduced in (27| (hereby referred to as the aperture filter) 
and in Q which hereby referred to as the Compensated Gaussian (CG). They 
are defined as: 



Wap(0) 



1 



29 f 



exp 



26 f 



if 9 < 9f zero otherwise 
(7) 



where 9f is the characteristic size of the filter. Figure |4] shows the excellent 
agreement between the ray-tracing simulation and the predictions for differ- 
ent smoothing filters. The measurements are based on a realistic redshift 
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Figure 4: The agreement between the measurements and the theoretical predictions based on 
the fitted galaxy population. The blue lines show the measurements on the simulated 12.84 
deg^ data, and the pink lines show the theoretical prediction for the same cosmological model 
and the full redshift distribution. The measurements in each panel are performed on smoothed 
data, with top-hat, aperture mass and Compensated Gaussian filters in order. The errorbars 
include both cosmic variance and statistical noise resulting from ng= 22 galaxies per arcmin^. 



distribution corresponding to a ground based survey with limiting magnitude 
'Tiiim = 24.5 with ng=22 galaxies per arcmin^. The error bars reflect the sta- 
tistical noise and cosmic variance for a 12.84 deg^ survey. From equation ([7]) 
the smoothing scale for the two filters are related as Oqq — 0ap/2V2, therefore 
the maximum smoothing scale chosen for the Compensated Gaussian filter is 25 
arcminutes compared to 84 arcminutes for the top-hat and aperture filters. 



3. 5. Estimate of the Covariance Matrix 

Cosmological parameter forecasting requires the estimate of the covariance 
matrix. Semi analytical methods are available in the literature ([3l[ and (s^l. 
but rely on the assumption of Gaussian statistics. An extension to the non- 



linear angular scales has been recently developed ([33|, [3j| and |35[), however 



the three-point statistics and source redshift distribution and shape noise of 
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realistic surveys were not considered. In this work the full covariance matrix 



C was estimated directly from the ray-tracing simulation as in [33| and 34l |. 
by taking into account the realistic characteristics of lensing surveys described 
in the previous sections. For each survey strategy, the total covariance matrix 
was calculated as follows: for each noise-free k line of sight, the redshift slices 
were combined and weighted according to equation ([1]) with the corresponding 
redshift distribution and galaxy number density. A noise map was then added 
following the method described in Section [331 Finally, the two- and three-point 
statistics were measured over 20 smoothing scales. The covariance matrix of 
the statistic x measured at two smoothing scales 9i and 9j is defined as: 

cie,,0,) EE {{x{e.,) - ^m){x{e,) - ^i{e,))) (8) 

where x is here either (k^), {tc') or and /i is the average calculated from the 
entire simulation set. 

It was shown by [36| that the inverse of the covariance matrix estimated 
from a finite number of ray-tracing simulations is biased. The authors derived 
a simple formula to correct for this effect which relates the number of scales p 
used in the two- (or three-) point statistics and the number n of lines-of-sights. 
The covariance matrix simply has to be replaced by a*C, where a* when the 
mean is determined from the data, is given by: 

a* = ^ (9) 

[3^ showed that this correction is applicable only when n—2 exceeds the number 
of scales p, otherwise the covariance matrix C is not invertible. In this paper n = 
60 simulations were used, and the statistics were measured over p = 20 angular 
scales for top-hat and aperture filter and p — lA for the Compensated Gaussian 
filter. The values of a* for these filters were then 1.55 and 1.28 respectively. 
For joint likelihood calculations the joint covariance matrices were rescaled by 
a* = 3.28 for top-hat and aperture filters and a* = 1.97 for the Compensated 
Gaussian filter. 

Because of the limited area covered by the simulations it is not possible 
to compute the covariance matrices for very large surveys. Fortunately, the 
angular scales where the non-linear effects are important (typically less than 
half a degree for the two- and three- point statistics) are much smaller than 
the 12.84 deg^ field-of-view of a simulation field. Those are also the scales 
where the lensing signal is best measured. Therefore the covariance matrices 
can be computed in the non-linear regime from the different realizations, and 
simply rescaled according to survey size for surveys exceeding the simulation 
box. This is an excellent approximation for angular scales much smaller than 
the simulation box, which was the case in our study since the largest scale used 
to measure the statistics was 84 arcminutes, which is much smaller than the 
dimension of 3.5 x 3.5 degrees of the simulations box. Figure [5] illustrates the 
scaling applied to some elements of the two- and three-point statistic covariance 
matrices. 
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Figure 5: The Cij elements of the covariance matrix as a function of the ratio of the original 
K-map simulation area (12.84 deg2) over the survey area. The solid colored lines are the Cij 
elements from the simulated maps and the black dotted lines are the straight line fit to each 
of them. The left figure shows the Cij of {k?) and the right one is the same for {k^). Here 
the covariance matrix contains only the cosmic variance. The scales are as following: i=l is 
0.42', i=5 is 1.26', i=10 is 4.20', i=15 is 31.5' and i=20 is 84.0'. It shows that the change 
in the covariance matrix of the cosmic variance is inversely proportional to the survey area. 
Hence this result was used to rescale the covariance matrices in the likelihood calculation to 
the desired survey area. 



In order to verify that the covariance matrix computed using ray-tracing 
simulations converges to the one computed in the Gaussian approximation for 
large angular scales, the following procedure was performed: Gaussian realiza- 
tions of the field k, were generated and then the covariance matrix was calculated 
in the same way as the ray-tracing simulations. [2^ and [s^l described a simple 
way to generate cosmological Gaussian fields by convolving white-noise with a 
filter whose transfer function is given by the square root of the power spectrum. 
The power spectrum was directly computed from the sample of ray-tracing sim- 
ulations, so that the resulting Gaussian fields have the same cosmology. Using 
this method, 60 lines of sight were generated and the covariance matrix of the 
Gaussian fields was computed as described by equation Q . Figure [5] shows the 
ratio of the non-Gaussian to Gaussian errors (i.e. the square root of the diagonal 
elements of the covariance matrix) for the two- and three-point statistics of the 
top-hat and Compensated Gaussian filters. It can be seen that for large scales 
the ratio converges to unity as expected. At small scales this ratio is larger than 
the unity due to the non-linear evolution of matter fluctuations. Moreover, for 
a given angular scale, the ratio between non-Gaussian and Gaussian errors is 
larger when one uses the Compensated Gaussian filter than when the top-hat 
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Figure 6: The ratio of Gaussian to non-Gaussian error estimated for convergence k two- and 
three-point statistics. On large scales the non-Gaussian errors estimated from the ray-tracing 
simulations converge to the Gaussian limit. The results for the top-hat and Compensated 
Gaussian filters are shown in left and right column respectively. 



filter is used. The reason lies in the fact that for a given characteristic scale, 
the Compensated Gaussian filter peaks at smaller scales than the top-hat filter. 
The ratio between non-Gaussian and Gaussian covariance depends on the av- 
erage redshift of the survey and for this test a distribution characterized by an 
average redshift z lA was used. The ratio would have been much higher if a 
much shallower survey was chosen. 



4. Survey Design and Observing Strategy 

4-1- Optimal smoothing scale 

The covariance matrix contains three terms (fsij) : 

C = Css + Cn, + Cnn (10) 

where Css is the pure signal (i.e. noise free) cosmic variance, Cnn is the pure 
noise covariance and Cns is the cross-correlation term. The goal in this section 
is to determine at which angular scale the measurement of the two- and three- 
point shear statistics has a better signal-to-noise ratio. For this purpose the 



12 



covariance matrix was separated into the three terms introduced above and their 
amphtudes for different filters were expfored. Practically Css can be calculated 
from the noise free ray-tracing realizations and Cnn from noise-only convergence 
maps. Among the three parts of the covariance matrix: Css, Cnn, and Cns, the 
mixed term Cns is the most computationally expensive to calculate. The reason 
is that the noise contribution to the covariance matrix converges more slowly 
than the cosmic variance contribution, and in practice, it is necessary to estimate 
the noise from more than 60 noise realizations. For the two-point statistics 
there are analytical formulae in 31| , but there is currently no equivalent for the 
three-point statistic and the skewness of the convergence. In order to inspect 
the three different terms, the covariance matrix was calculated as follows: for 
each noise realization, a C was calculated, which was relatively noisy because 
it was obtained from one noise pattern. Then the average of C was taken over 
ten noise realizations. The covariance matrix thus obtained was specialized 
for a given noise statistical property, and the whole calculation was repeated 
each time the observing conditions affecting the noise were changed. Cnn was 
calculated separately over ten thousand realizations. The average Css and Cnn 
were used to determine the cross term Cns- Because of the averaging process 
one obtains a covariance matrix which has a relatively small noise making C 
invertible. To illustrate the contribution of each of these parts the diagonal 
elements of the Css, Cnn, Cns and C were extracted as the noise term for each 
smoothing scale. 

Figure [7] shows the relative contribution of different terms in the covariance 
matrix. The noise-to-signal ratio for the individual components of the covariance 
matrix are shown. The blue (long dashed) line is the signal-signal which is the 
result of the cosmic variance only. The noise-noise term is shown with black 
(short dashed) line. The mixed term was derived from C — Cgs — Cnn and is 
shown in red (dash-dotted) line and the green (solid) line shows the total noise 
over signal ratio. As expected the finding was that small scales were dominated 
by statistical noise and the large scales by cosmic variance, where the signal 
is low. Interestingly, the mixed noise term is relatively negligible for the two- 
and three-point statistics, although it is clear that future high precision surveys 
will have to take it into account. Surprisingly, the mixed noise term is strongly 
dominant for the skewness. 

In agreement with [3] a range of optimal angular scales (between one ar- 
cminute and half a degree) was found for which the total noise affecting the 
two-point shear statistics is minimal, this is also the case for the three-point 
statistics. 



4-2. Wide and Shallow versus Deep and Narrow 

Many of the future lensing surveys will have a limited observing time and a 
full sky coverage will not be possible. The question will arise whether a deep 
and narrow survey performs better than a large and shallow survey. Therefore, 
it is important to quantify what is the optimal balance between survey size and 
depth, given a fixed observing time. It is expected that very shallow surveys 
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Figure 7: The noise-to-signal ratio for the cosmic variance only in blue (long dashed) line, 
statistical noise only in black (short dashed) line, the mixed term in red (dash-dotted) line 
and the total noise in green (solid). The (k'^), (k^) and S3 measurements were calculated 
for a simulated 12.84 deg^ data smoothed with top-hat, aperture and Compensated Gaussian 
(from top to bottom). 
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Area(deg^) 1400 1150 900 514 257 115 45 ~20 



miim 



22.5 



23.0 



23.5 24.0 24.5 25.0 25.5 26.0 



ng/arcmin"' 2 



5 



9 14 22 28 37 45 



GF 1 



1.8 



4.0 



5.0 2.5 2.4 1.5 1.3 



Table 2: The area and i-band limiting magnitude and the corresponding galaxy number 
density of different surveys with the same observing time. The gain factor GF is the ratio 
between the Qrn If width of the two-point statistics contours over that of the two- and three- 
point statistics joint contour. (*) Due to the truncated likelihood la contours the GF is not 
calculated for the deepest survey. 

would provide a poor weak lensing measurement due to the small lensing effi- 
ciency for nearby sources, and deep-narrow surveys will be limited by cosmic 
variance. The trade-off between those radically different survey designs must 
include a proper estimate of the amplitude of the lensing signal and shot noise 
as function of survey depth. 

The relation between limiting magnitude and surve y a rea for a fixed observ- 
ing time was derived from the algorithm developed in [37| . The galaxy number 
density was obtained by selecting galaxies which signal-to-noise detection level 
was larger than 7 and which are also well resolved for weak lensing studies fol- 
lowing the criteria given in Section 13.21 Table [2] shows the survey area and 
limiting magnitude for each case investigated here. 

The likelihood is given by: 



where d is the data and m is the theoretical model. C^^ is the inverse covariance 
matrix over all lines of sight. As described in the previous section the covariance 
matrix was computed directly by using the simulations and its inverse had been 
re-calibrated using equation The likelihood contours were performed in the 
f^m-CTs parameter space, rim was varied between 0.1 and 1.0 with 0.05 intervals 
and CTs values were between 0.50 and 1.50 with 0.05 intervals. 

Figure [8] shows the pink (dark gray) contours for (k^) and cyan (light grey) 
for (k^) likelihood for top-hat filter. The filled contours show the joint (k^)-(k^) 
likelihood. The (k^) and (k^) contours become more degenerate for deep and 
narrow surveys, whereas for wide and shallower surveys it appears clearly that 
the (k^) and (k^) likelihood contours have a different orientation in the flm-cs 
plane, which explains why the joint analysis works better for wide and shallow 
surveys. One can see indeed that the individual two- and three-point statistics 
contours for the wide and shallow surveys become large again due to a larger 
noise, but the joint analysis remains competitive. This could be attributed to 
the larger sensitivity of the three-point statistics to non-linear effects for shallow 
surveys as a consequence of the projection of mass along the line-of-sight (i.e. 
identical angular scale probes more non-linear scales for shallow rather than 
deep survey). For the joint two- and three-point statistics analysis, the medium 
depth surveys (niiim ==23.5 or 24.0) appear optimal. It is clear that for a fixed 



£ = exp — -(d— m)"^*C ^ * (d — m) 



(11) 
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observing time, our results favor the medium shallow-wide surveys. The gain 
factor GF is defined as the ratio of the la error width of the (k^) contours over 
that of joint {k^)-{k^) measurements which quantifies the improvement when 
the joint statistics is considered. The values of the GF corresponding to the 
likelihood contours of figure [5] are shown in table [H 

Unfortunately, the skewness of the convergence, defined in equation does 
not appear to yield as powerful constraints as the combined two- and three- 
point statistics. Figure [5] shows the error contours using S3 for three choices 
of limiting magnitude and survey area. The observing time here was fixed, like 
for the previous analysis. As expected, the dependence on cg is very weak, but 
one can see that the width of the contours along the ^Im axis is much larger 
than the 0^ constraints one gets from the joint analysis shown in Figure [H 
Following the same trend as joint (k^)-(k^) likelihood results shown in figure [H 
the medium depth surveys lead to the most optimal skewness measurement. The 
constraints for the shallower surveys (i.e. miim =22.5 and 23.0) are not shown 
here. Those surveys give poor cosmological constraints, as the mixed Cns term 
of the covariance at the scales of interest becomes large. Overall, the skewness 
does not appear to be as attractive a statistic to break the as-^m degeneracy 
as previously advocated (^] and [6*]). Measuring the skewness on the current 
and near future lensing surveys will be very challenging, and it is clear that a 
large fraction of the sky is needed in order to bring the noise contributions to a 
low enough level for precision cosmology. 

The reason why the skewness is hard to measure lies in the fact that the 
variation of the skewness amplitude for different flm models is largely absorbed 
by the cosmic variance of this estimator. This is not the case for the two- and 
three-point statistics taken separately. Figure [10] shows the comparison between 
various predicted cosmological models and the measurements from the simula- 
tions, (k^), (k"^) and S3 were measured for survey area of 12.84 deg^ of limiting 
magnitude of 24.5 over the 60 lines of sight. The blue line shows the measured 
data points; the errorbars contain both cosmic variance and statistical noise. 
The pink (soHd) Hue is the fiducial model (rim=0.24, riA=0.76 and as=0.7A). 
The black (dotted), green (dashed) and red (dash-dotted) lines are models with 
the same erg — 0.75 and values of n„i = 0.20,0.40 and 0.80 respectively, while 
the purple (dash-dot-dotted) line corresponds to a model with ri„i = 0.30 but 
as = 0.50. The plot shows that the measurement of (k^) and (k^) are much 
more sensitive to the r^m, as parameters than the skewness 6*3, therefore their 
ability to separate various cosmological models is stronger. 

5. Canada-France-Hawaii Legacy Survey Three-point Statistics Pre- 
dictions 

The Canada- France-Hawaii Telescope Legacy Survey covers 170 deg^ in four 
patches Measurements of the two-point cosmic shear statistics have been 
pubHshed using the first year ([sll, [3§| and [ij) and third year data release 01 
in addition to studies of galactic scale dark matter halos In this study 
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Figure 8: The likelihood analysis for various survey depths and areas with fixed observing time 
for (/t^), (k^) smoothed with the top-hat filter. The observing time is equal for all cases, while 
the survey area and depth vary. Table [2] shows the values for mj;^ with the corresponding 
survey areas. The pink (dark grey) contours indicates the la, 2a and 3cr errors for the (k^) 
statistics and the cyan (light grey) contours are the same for the (k^). The covariance matrix 
contains both the cosmic variance and the statistical noise. Here the joint likelihood shown in 
filled contours is calculated by taking into the account the {k'^)-{k^) correlations at different 
scales. 
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Skewness Tophat 




0.2 0.4 0.5 0.8 1.0 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0 

Ou n„ 0. 



Figure 9: The likelihood analysis for various survey depths and areas with fixed observing 
time for skewness 53 smoothed with the top-hat filter. The observing time is equal for all 
eases, while the survey area and depth vary. Table [2] shows the values for mii,jj with the 
corresponding survey areas. The covariance matrix contains both the cosmic variance and the 
statistical noise. The skewness measurements are most optimal for shallower surveys. 

the expected improvement for cosmological parameter constraints, using a com- 
bination of two- and three-point lensing statistics on the completed CFHTLS- 
wide survey was determined. A mock CFHTLS-widc survey type of 170 deg^ 
was generated using a hmiting magnitude of mii,„ = 24.5 (i-band) with Ug—Tl 
galaxies per arcmin^, Zmcd=0.91 and a^=0A4 and the potential contamination 
by residual systematics was ignored. 

Figure [TT] shows the flm and (Ts error contours from a joint measurement 
of (k^) and (k'^) for the three filters used in this study. It is clear that the 
top-hat filter leads to a more significant degeneracy breaking between flm and 
fTg, which can be understood by the fact that this filter preserves modes with 
wavelengths larger than the smoothing size, while the aperture filters are not 
sensitive to large scale variations. The joint two- and three-point analysis of 
the completed CFHTLS-Wide will constrain and as to 17% and 10% re- 
spectively. This corresponds to a gain factor (GF) of ~ 2.5 (for and ~ 2.1 
(for fjg) improvement on the two-point analysis alone when the top- hat filter is 
used. It is interesting to compare Figure [TT] to a generalized approach which 
can serve to quantify the performance of the different filters. The generalized 

is defined as: 



— = Vd^^C-Ud (12) 

where C is the covariance matrix of the statistics under consideration. The 
results for a 12.84 deg^ survey and limiting magnitude mii,„ — 24.5 arc shown 
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Figure 10: The comparison between the measured values of {k?), {iC') and 53 and different 
cosmological models over a survey area of 12.84 deg^.The blue line shows the measured data 
points, and the pink (solid) line is the fiducial model. The black (dotted), green (dashed) and 
red (dash-dotted) lines are models with the same (jg = 0.75 and values of = 0.20,0.40 
and 0.80 respectively, while the purple (dash-dot-dotted) line corresponds to a model with 
f2m = 0.30 but iT8 = 0.50. The plots show that the measurement of (k^) and (k^) are much 
more sensitive to the , 

(Tg parameters than the skewness ^3. This is why we can not 
currently constrain the fim-fs plane with skewness measurements. 
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in table [31 and they indicate that for two-point statistics the different filters are 
equivalent. The top- hat filter outperforms the aperture filters for the three-point 
statistics. It is a direct illustration that top-hat preserves small and large scale 
modes, and it is therefore more sensitive to non-linear effects. This invalidates 
the fact that the Compensated Gaussian filter is the most efficient measure of 
the skewness of the convergence [3] . The reason lies in the fact that for a fair 
comparison the maximum smoothing scale for Compensated Gaussian filter may 
not exceed a third of top-hat and aperture smoothing radii. This can be seen by 
looking at the equations which define the shape of the filters (see Section [3^ . 



S/N 






S3 


Top-hat 


6.19 


2.68 


5.45 


Aperture 


6.05 


1.61 


2.17 


Compensated Gaussian 


6.93 


1.88 


3.24 



Table 3: The generalized results for top-hat, aperture and Compensated Gaussian filters. 
The full covariance matrix is that of the 12.84 deg^ maps. The data d is from the K-maps 
smoothed with top-hat, aperture and Compensated Gaussian filters. The correlation between 
the scales are contained in the signal-to-noise ratio. 



The joint {k^)-{k^) likelihood analysis with top-hat, aperture and Compen- 
sated Gaussian filters proved to be promising, whereas the skewness which is in 
principle a very interesting statistic inferred very weak cosmological constraints 
even for the current largest weak lensing survey at 170 deg^. Figure [T^ shows 
skewness likelihood contours obtained using both top-hat and Compensated 
Gaussian filter for CFHTLS-like survey confirming what stated above about 
the poor efhciency of the skewness. 

One of the forthcoming weak lensing surveys is the Kilo Degree Survey 
(KIDS area of 1500 deg^ at miim=23.5). We performed for the KIDS survey the 
same analysis as for the CFHTLS-Wide to forecast the accuracy of the likelihood 
constraints using two- and three-point shear statistics. Moreover, for comparison 
the calculations were repeated for a survey with the same observing time needed 
for the KIDS survey but different total area and depth. The results establish 
which survey design would be the most optimal to infer constraints using the 
joint two- and and three- point shear statistics. The expected likelihood contours 



for the complete KIDS survey are shown in the panel 13(a) whereas panel 13(b) 



shows the same results when a deeper (mii,ii=24.5) and narrower (area=450 
deg^) survey given the same observing time was considered. As expected from 
Figure [8] the shallower KIDS gives better results for the joint likelihood, but 
the skewness would be slightly better measured from the deeper (miini=24.5) 
survey. 



6. Conclusion and Discussion 

We studied how useful the measurement of the two- and three-point shear 
statistics can be to derive cosmological constraints under realistic observing 
conditions. One of the limitations of the previous work on this topic was the 
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Figure 11: The cosmological constraints on Qm-as plane obtained with different smoothing 
filters. The contours show the (k^) and (k^) joint likelihood forecast based on CFHTLS 
completed area. The survey area is 170 deg^ and the limiting magnitude is 24.5 with the full 
redshift distribution. Here the pink (dark grey) contours show the (k^) and the cyan (light 
grey) contours show the (k^) constraints. The filled contours correspond to the lir, 2(T and 
3a errors for the joint likelihood. The fiducial model used is a ACDM with flrn = 0.24 and 
as = 0.74. The degeneracy direction of the (k^) and {k^) likelihood is different (especially 
when the maps are smoothed with Compensated Gaussian filter) so their joint likelihood 
results in a tighter constraints on the parameters. The joint likelihood here is calculated by 
taking into account the cross correlations between (k^) and (k"^) at all scales. 
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Figure 12: The skewness S3 likelihood analysis for the CFHTLS-wide predictions. The 
smoothing filters top-hat and Compensated Gaussian are used. The aperture filter does not 
provide any constraint on the Qm-crs plane for the given survey characteristics. The covariance 
matrix contains both the cosmic variance and the statistical noise. 

disconnection between the source redshift distribution and the survey depth 
under consideration. Here, a set of ray-tracing simulations was populated with 
source galaxies that follow a redshift distribution and galaxy number density 
calibrated from real data. 

We then investigated how well the parameters as and can be measured 
with different smoothing filters for different survey depths. For a fixed observing 
time, the results of the study favored the medium depth and width survey 
over shallower and wider or deeper and narrower surveys. There is an optimal 
survey depth versus size for which the source density (survey depth) and cosmic 
variance (survey area) are balanced, which turns out to be very close to the 
CFHTLS depth. Our results can be applied also to surveys covering a large 
fraction of the sky with no limitation on the observing time (e.g. PanSTARRS 
and LSST) by a simply rescaling the covariance matrices. They can also be 
extended to space data if the amplitude of ellipticity noise and galaxy number 
counts are adjusted according to space observations (this is particularly relevant 
for a wide field space imager like the one by JDEM). 

We find that the lensing statistics are best measured at scales between 1 to 
30 arcminutes, where the contribution of statistical noise, cosmic variance and 
the mixed term are minimal. We also find that the different smoothing filters 
give similar results although the top-hat appears to include more modes and is 
therefore slightly better than the others. Combining (k^) and (k'^) is promising 
to achieve cosmological constraints in the D,m-<^8 parameter space. On the 
contrary the skewness of the convergence does not appear able of breaking the 
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(b) 

Figure 13: The comparison between a KIDS-like survey at two limiting magnitudes. The 
right panels show the likelihood contours of (k^) and (k^) smoothed with the top-hat filter. 
The left panels are the skewness contours. The survey area for the panel [(a)] is 1500 deg^ as 
is planned for the KIDS survey with m]i„i=23.5. In panel [(b)] the observing time is kept the 
same and the survey area if adjusted to 450 deg'^ for miini=24.5. 
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degeneracy between erg and as initially expected (d and 0). The reason is 
that the cosmic variance on 5*3 is comparable to the difference in lensing signal 
amplitude for the different cosmological models of interest. Only very large 
surveys will be able to measure the skewness accurately. 

We forecasted the cosmological constraints for the CFHTLS-wide completed 
survey finding that the combination of two- and three-point functions on the 
CFHTLS will greatly enhance the measurement of cts and 51^. A similar calcu- 
lation showed the potential precision achievable with the future KIDS survey. 

This study has some limitations which will be investigated in future work. 
One of them is the fact that the source galaxies are clustered in three dimensional 
space which overlap with lens redshift distribution (problem known as the source 



clustering problem, [25|). This effect leads to a change in the skewness of the 



convergence (by as much as 25%), and its impact on the three-point statistics 
has not been evaluated yet. A recent study also showed potential impact on 
the two-point statistics, although at a moderate level ([l^). Another limitation 
is the potential impact of intrinsic alignment on three-point statistics. This is 
particularly relevant for shallow surveys such as PanSTARRS or KIDS fjiH). 
This effect should be taken into the account as well. We would be able to 
investigate these two complications with ray-tracing simulations which include 
galaxies in dark matter halos; this can be realized by the use of semi analytical 
models such as the ones described in [291. 
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